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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 



form: 



Application Serial Number: 04 /f/9 £b 



■ i 



Source: 



Date Processed by STIC: 



#9 3d ZLdd/- 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

pipaw FORWARJDTTHIS INFORMATION TO THE APPLICANT BY EITHER: 

^^^^^^^nONTOVT IN YOUR NEXT COMMUNICATION TO THE 

APPI ICANT WITH A NOTICE TO COMPLY or, 
2) TCLEMWNDW APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 

FOR^sSlS^QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE INTACT ROBERT WAX 703-308-4216. 
PATENTIN 2.1 c-mail help: rstin7lhHn@usnto.eov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 c-mail help: patin3hdn@usnto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
W.S 3 0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0. works for sequence listings 
generated for the original version of 37 CFR §§1 821 - 1.825 effective October I 1990 (old 
hiles) and the revised version (new rules) effective July I, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Choker Version 30 ran he down Inan^ri from the USPTO w eb site at the following address: 

http://www.uspto.gov/web/offices/pac/ch£Cker' 



»4 



Raw Sequence Listing Error Summary 



.** 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



I Wrapped Nucleic* 

Wrapped Aminos 



S 



10 



11 



12 



The numbcrAext at the end of each line "wrapped** down to the next line. Jhis may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to 3; this will 
prevent ^wrapping.** 



Invalid Line Length The rules require that a line not csceed 72 characters in length, this includes white spaces. 



3 Misaligned Amino 

Numbering 



Non-ASCII 




Variable Length - 



Patentln 2.0 
"bug" 



The numbering under each 3* amino acid is. misaligned. Do not use tab codes between numbers; 
use space characters, instead. < ' 

The submitted file was not saved in ASCU(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved In ASCII text. 

Sequencers ) Q contain n*s or Xaa>s*representing more than, one residue. Per Sequence Rules, 
each it o r Xaa can only rep res ent a aln g le residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patent In version 2.0 has closed fife <220>*<223> section to be missing from amino acid 
sequencers) 



. Normally. Patent In i would'automaticslfxg'cncrate this section from the 
previously coded nucleic acid sequence. Please manually copy th« relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to (he mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Skipped Sequences Sequences) missing. If intentional, please insert the following lines for eaeh skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do npt insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION :SEQ ID NO.X: (insert SEQ ID.NO where "X" is shown) 
This sequence is intentionally skipped 

* 

Please also adjust the M (ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



(OLD RULES) 



Skipped Sequences 
"(NEW RULES) 



Use of n't or Xaa's 
"(NEW RULES) 



Invalid <213> 
Response 



Sequencers) 



missing. If Intentional, please insert the following lines for each skipped sequence. 



Use of<220> 



<2 1 0> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa't have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa*s arc present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown, Artificial Sequence, or 
sctentiTtc'namc (Genus/spocics). <220>-<223> section is required when <2 13> response is Unknown or 
is Artificial Sequence 

missing the <220> "Feature" and associated numeric identifiers and responses. 



Patentln 2.0 
* "bug" 



Sequencers) 

Use of <220> to <223> is MANDATORY if <2I3> "Organism" response is "Artificial Sequence" or 

"UnJcnown," Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instczd, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n 



ri can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 
any value not specifically a nucleotide: 



£e type of erro^shown^t throughout 



AMC/MK - Biotechnology Systems Branch - 08/21/2001 



r* * 



* +- ■ _ '■ 



_ - ^ J p age l of 7 

* 

PCT09 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/719,261 



DATE: 08/30/2001 
TIME:"^09:46:14 





4 




5 




6 




7 




8 




9 




11 




13 




15 


c--> 


16 




18 




19 




21 




22 




24 




26 




29 




30 




31 




32 




34 




35 




36 




37 




39 




40 




41 




42 




44 




45 




46 




47 




49 




50 




51 




53 


w--> 


54 




55 




59 




60 




61 




62 




64 




65 




66 




67 




69 



Input Set : A:\Mewburn-Matassa ('261j Sequence Listing.txt 
Output Set: N:\CRF3\08302001\l719261.raw 



<110> APPLICANT: Istit.uto 
Matassa, Victor 
Narjes, Frank 
Koehler, Konrad 
Ontoria, Jesus 
Poma, Marco 



Di Ricerche Di Biologia Molecolare P. Angeletti SpA 

- 1 

Daet Nat @§fflBty 

Correeted Diskette Hm&%$ 



C virus NS3 protease 



SEQ ID NOS: 13 
Patentln Ver. 2 
1 



.phenylj^ajiine^ 



<130> FILE REFERENCE: KMN/FP5780044 
<140> CURRENT APPLICATION NUMBER: 09/719,261 
<141> CURRENT FILING DATE: 2001-07-23 
<150> PRIOR APPLICATION NUMBER: PCT/GB99/01824 
<151> PRIOR FILING DATE: 1999-06-09 
<150> PRIOR APPLICATION NUMBER: GB 9812523.0 
<151> PRIOR FILING DATE: 1998-06-10 
<160> NUMBER OF 
<170> SOFTWARE: 
<210> SEQ ID NO 
<211> LENGTH: 4 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
FEATURE : 
NAME/KEY: SITE 
LOCATION: (1) 
OTHER INFORMATION: Xaa is 
FEATURE : 
<221> NAME/KEY: SITE 
<222> LOCATION: (3) 

OTHER INFORMATION: Xaa is 
FEATURE: 
NAME/KEY: SITE 
<222> LOCATION: (4) 

<223> OTHER INFORMATION: Xaa is 4 , 4 -dif luoro-2 -am ino but yric acid 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

sequence s 
<400^SEQUEJKJE: A. 
Xaa/Glu Xaa Xatf 
1 

<210> SEQ ID NO: 2 
<211> LENGTH: 6 
<212> TYPE: PRT 



NAME/KEY: SITE 
LOCATION : ( 6 ) 
OTHER INFORMATION: 
FEATURE : 



<220> 
<221> 
<222> 
<223> 
<220> 



<223> 
<220> 
<221> 



cyclohexylalanine^ 



<223> 
<220> 



Xaa is 4 , 4-dif luoro-2 -amino* butyric acid 



file://C:\CRF3\Outhold\VsrI71 9261 .htm 



8/30/01 



RAW SEQUENCE LISTING DATE: 08/30/2001 

PATENT APPLICATION: US/09/719 , 261 TIME: 09:46:14 



Input Set : A:\Mewburn-Matassa ('261) Sequence Listing.txt 
Output Set: N:\CRF3\08302001\I719261.raw 

70 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

71 sequence , \ 
-73 <400> SEQUENCE: 2 

W--> 74 Asp Glu Met Glu Glu Xaa 
75 1 5 

78 <210> SEQ ID NO: 3 - 

79 <211> LENGTH: 6 

80 <212> TYPE: PRT 

81 <213> ORGANISM: Artificial Sequence 

83 <220> FEATURE: 

84 <221> NAME/KEY: SITE A * 

85 <222> LOCATION: (3) , .. 

86 <223> OTHER INFORMATION: Xaa is diphenylalanine 

88 <220> FEATURE: 

89 <2 2 1> NAME/KEY: SITE 

90 <222> LOCATION: (5) 

91 <223> OTHER INFORMATION: Xaa is cyclohexylalanine 

93 <220> FEATURE: 

94 <221> NAME/KEY: SITE 

95 <222> LOCATION: (6) 

96 <223> OTHER INFORMATION: Xaa is 4 , 4 -dif luoro- 2 -amino butyric acid 

98 <220> FEATURE: 

99 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

100 sequence 
102 <400> SEQUENCE: 3 

W--> 103 Asp Glu Xaa Glu Xa; 
104 1 5 

107 <210> SEQ ID NO: 4 

108 <211> LENGTH: 6 

109 <212> TYPE: PRT 

110 <213> ORGANISM: Artificial Sequence 

112 <220> FEATURE: 

113 <221> NAME/KEY: SITE 

114 <222> LOCATION: (3) 

115 <223> OTHER INFORMATION: Xaa is diphenylalanine 

117 <220> FEATURE: 

118 <221> NAME/KEY: SITE 

119 <222> LOCATION: (5) 

120 <22 3> OTHER INFORMATION: Xaa is cyclohexylalanine 

122 <220> FEATURE: 

123 <221> NAME/KEY: SITE 

124 <222> LOCATION: (6) 

125 <223> OTHER INFORMATION: Xaa is a fluorinated hydrocarbon side chain 

127 <220> FEATURE: 

128 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

129 sequence^ 
131 <400> SEQUENCE: 4 

W--> 132 Asp Glu Xaa Glu xia^ Xa^ 
133 1 5 





fiIe://C:\CRF3\Outhold\VsrI71 926 1 .htm 




RAW SEQUENCE LISTING DATE : 08/30/2001 

PATENT APPLICATION: US/09/719,261 TIME: 09:46:14 

Input Set : A:\Mewburn-Matassa ('261) Sequence Listing. txt 
Output Set: N:\CRF3\08302001\I719261.raw 

136 <210> SEQ ID NO: 5. 

137 <211> LENGTH: 6 
- 138 <212> TYPE: PRT 

139 <213> ORGANISM: Artificial Sequence 

141 <220> FEATURE: . j 

142 <221> NAME/KEY: SITE w j re t?re$e*i <^ sifl<\H 

143 <222> LOCATION: (6) fi** WA 7 ^H" 1 \ \ 

144 <223> OTHER INFORMATION: Xaa is a fluorinated hydrocarbo n side chain 

146 <220> FEATURE: 

147 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

148 sequence - a* 
150 <400> SEQUENCE: 5 

W--> 151 Asp Glu Met Glu Glu Xaa 
152 1 5 

155 <210> SEQ ID NO: 6 

156 <211> LENGTH: 5 

157 <212> TYPE: PRT 

158 <213> ORGANISM: Artificial Sequence 

160 <220> FEATURE: 

161 <221> NAME/KEY: SITE 

162 <222> LOCATION: (1) 

163 <223> OTHER INFORMATION: Asp as tertiary butyl ester 

165 <220> FEATURE: 

166 <221> NAME/KEY: SITE 

167 <222> LOCATION: (2, 4, 5) 

168 <223> OTHER INFORMATION: Glu as tertiary butyl ester 

170 <220> FEATURE: 

171 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

172 sequence 

174 <400> SEQUENCE: 6 

175 Asp Glu Met Glu Glu 

176 1 5 

179 <210> SEQ ID NO: 7 

180 <211> LENGTH: 5 

181 <212> TYPE: PRT 

182 <213> ORGANISM: Artificial Sequence 

184 <220> FEATURE: 

185 <2 2 1> NAME/KEY: SITE 

186 <222> LOCATION: (1) 

187 <223> OTHER INFORMATION: Asp as tertiary butyl ester. 

190 <2 20> FEATURE: 

191 <221> NAME/KEY: SITE 

192 <222> LOCATION: (2, 4) 

193 <223> OTHER INFORMATION: Glu as tertiary butyl ester 

195 <220> FEATURE: 

196 <221> NAME/KEY: SITE 

197 <222> LOCATION: (3) 

198 <223> OTHER INFORMATION: Xaa is diphenylalanine 
200 <220> FEATURE: 



may 

The type of errors showiuuast throughout 
the Sequence Usting. Pl&se check subsequent 
sequences for similar errors. 



file://C:\CRF3\Outhold\VsrI71926 1 .htm 



RAW SEQUENCE LISTING DATE: 08/30/2001 

PATENT APPLICATION: US/09/719,261 TIME: 09:46:14 

Input Set : A:\Mewburn-Matassa ('261) Sequence Listlng.txt 
Output Set: Ni\CRP3\08302001\l719261.raw 

201 <221> NAME/KEY: SITE 

202 <222> JJOCATION: (5) 

- 203 <223> OTHER INFORMATION: Xaa is cyclohexylalanine 

205 <220> FEATURE: 

206 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

207 sequence 

209 <400> SEQUENCE: 7 ^ \ ' 

W--> 210 Asp Glu Xaa Glu Xaa 
211 1 5 

214 <210> SEQ ID NO: 8 

215 <211> LENGTH: 17 ' ' 

216 <212> TYPE: PRT v..-- 

217 <213> ORGANISM: Artificial Sequence 

219 <220> FEATURE: 

220 <2 2 1> NAME/KEY: MOD_RES 

221 <222> LOCATION: (17) 

222 <223> OTHER INFORMATION: AMI DAT I ON 

224 <220> FEATURE: 

225 <223> OTHER INFORMATION: Description of Artificial Sequence': Synthetic 
22 6 sequence. 

228 <4 00> SEQUENCE: 8 

229 Lys Lys Lys Gly Ser Val Val lie Val Gly Arg He He Leu Ser Gly 

230 1 5 10 15 
232 Arg 

235 <210> SEQ ID NO: 9 

236 <211> LENGTH: 13 

237 <212> TYPE: PRT 

238 <213> ORGANISM: Artificial Sequence 

240 <220> FEATURE: 

241 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

242 sequence 

24 4 <4 00> SEQUENCE: 9 

245 Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr Lys 

246 1 5 10 
249 <210> SEQ ID NO: 10 

2 50 <211> LENGTH: 4 

251 <212> TYPE: PRT . 

252 <213>. ORGANISM: Artificial Sequence 

254 <220> FEATURE: 

255 <221> NAME/KEY: SITE 

256 <222> LOCATION: (1)..{3) 

257 <223> OTHER INFORMATION: Phenyalanines are linked by an ether bond 

259 <220> FEATURE: 

260 <221> NAME/KEY: SITE 

261 <222> LOCATION: (4) 

262 <223> OTHER INFORMATION: Xaa is 4 , 4 -dif luoro-2-amino butyric acid 

264 <220> FEATURE: 

265 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

266 sequence 




file://C:\CRF3\Ou thold\VsrI7 1 926 1 .htm 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/719,261 



DATE: 
TIME: 



08/30/2001 
09:46:14 



Input Set : A:\Mewburn-Matassa ('261) Sequence Listing.txt 
Output Set: N:\CRF3\08302001\I719261.raw 



W-- 



W--> 



w--> 



268 
> 269 

-270 
273 
274 
275 
276 
278 
279 
280 
281 
283 
284 
285 
286 
288 
289 
290 
291 
293 
294 
295 
297 
298 
299 
302 
303 
304 
305 
307 
308 
309 
310 
312 
313 
314 
315 
317 
318 
319 
320 
322 
323 
324 
326 
327 
328 
331 
332 



Xaa is diphenylalafrine 



<400> SEQUENCE : 10. 
Phe Glu Phe Xaa 
1 

<210> SEQ ID NO: 11 
<211> LENGTH: 6 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 
<221> NAME/KEY: SITE 
<222> LOCATION: (3) 
<223> OTHER INFORMATION 
<220> FEATURE: 
<221> NAME/KEY: SITE 
<222> LOCATION: (5) 

<223> OTHER INFORMATION: Xaa is cyclohexylalanine 
<220> FEATURE: 
<221> NAME/KEY: SITE 
<222> LOCATION: (6) 
<223> OTHER INFORMATION: Xaa is 
<220> FEATURE: 
<223> OTHER INFORMATION: 

sequence 
<400> SEQUENCE: 11 
Asp Glu Xaa Glu Xaa Xaa 



3 -amino- 5, 5-dif luoro-pentanoic acid 
Description of Artificial Sequence: Synthetic 



1 

<210> 
<211> 
<212> 
<213> 
<220> 



5 
12 



SEQ ID NO: 
LENGTH: 5 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 
<221> NAME/KEY: SITE 
<222> LOCATION: (2) 

<223> OTHER INFORMATION: Xaa is diphenylalanine 
<220> FEATURE: 
<221> NAME/KSY: SITE 
<222> LOCATION: (4 ) 

<223> OTHER INFORMATION: Xaa is cyclohexylalanine 
<220> . FEATURE : 
<221> NAME/KEY: SITE 
<222> LOCATION: (5) 
<223> OTHER INFORMATION 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

sequence 
<400> SEQUENCE: 12 
Glu Xaa lie Xaa Xaa 
1 5 
<210> SEQ ID NO: 13 
<211> LENGTH: 6 



Xaa is 4 , 4-dif luoro- 2 -amino butyric acid 



file://C:\CRF3\Outhold\VsrI7 1 926 1 .htm 



VERIFICATION SUMMARY DATE: 08/30/2001 

PATENT APPLICATION: US/09/719,261 TIME: 09:46:15 



Input Set : A:\Kewburn-Matassa ('261) Sequence Listing. tact 
Output Set: Ni\CRF3\08302001\I719261.raw 



L:16 M:271 C: 
L:54 M:341 W: 
L:74-M:341 W: 



Current Filing Date differs. Replaced Current Filing bate 



L:103 
L:132 
L:151 
L:210 
L:269 
L:298 
L:327 
L:356 



M:341 
M:341 
M:341 
M:341 
M:341 
M: 341 
M:341 
M:341 



W 
W 
W 
W 

w 
w 
w 
w 



(46) 
(46) 
46 
46 
46 
46 
46 
46 
46 
46 



"n" 
"n" 



or 
or 



"n" 

"n" 
"n" 

"n" 
"n" 



or 
or 
or 
or 
or 
or 
or 
or 



"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 



used/ for SEQ ID# : 1 
used/ for SEQ ID#:2 
used, for SEQ ID# : 3 
used, for SEQ ID#:4 
used, for SEQ ID# : 5 
used, for SEQ ID# :7 
used, for SEQ ID#:10 
used, for SEQ ID#:11 
used, for SEQ ID#:12 
used, for SEQ ID#:15* 
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